Picture for Xiaodong Gu

Xiaodong Gu

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

Add code
May 28, 2026
Viaarxiv icon

Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing

Add code
May 06, 2026
Viaarxiv icon

ShredBench: Evaluating the Semantic Reasoning Capabilities of Multimodal LLMs in Document Reconstruction

Add code
Apr 26, 2026
Viaarxiv icon

EffiSkill: Agent Skill Based Automated Code Efficiency Optimization

Add code
Mar 29, 2026
Viaarxiv icon

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents

Add code
Feb 08, 2026
Viaarxiv icon

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Add code
Feb 02, 2026
Viaarxiv icon

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Add code
Jan 23, 2026
Viaarxiv icon

Reasoning in Trees: Improving Retrieval-Augmented Generation for Multi-Hop Question Answering

Add code
Jan 16, 2026
Viaarxiv icon

GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts

Add code
Jan 08, 2026
Viaarxiv icon

In Line with Context: Repository-Level Code Generation via Context Inlining

Add code
Jan 01, 2026
Viaarxiv icon